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0 .   ABSTRACT 

A  scheme  for  obtaining  a  stationary  sequence  of  discrete 
random  variables  which  has  p-th  order  Markov  dependence  and  a 
specified  marginal  distribution  is  presented.   This  DAR(p)  process/ 
which  is  a  particular  p-th  order  Markov  chain,  has  the  physical 
and  correlation  structure  of  an  autoregressive  process  of  order  p. 
The  process  and  its  transition  matrix  are  determined  by  the 
specified  marginal  distribution  and  by  several  other  parameters 
which,  independently  of  the  marginal  distribution,  determine  the 
correlation  structure.   Correlational  properties  and  initial 
conditions  for  stationarity  of  the  process  are  studied.   Asymptotic 
properties  for  the  process  are  also  obtained. 
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1.   INTRODUCTION 

In  Jacobs  and  Lewis  (19  78a)  a  scheme  for  obtaining  a 
sequence  of  discrete  random  variables  with  first  order  Markov 
dependence  and  with  a  given  marginal  distribution  is  presented. 
The  process  is  a  Markov  chain  and  has  the  physical  and  correlation 
structure  of  a  first  order  autoregressive  process.   In  this  paper 
we  extend  this  result  and  present  a  scheme  for  obtaining  a 
stationary  sequence  of  discrete  random  variables  with  p-th  order 
Markov  dependence  and  having  a  given  marginal  distribution.   The 

process  is  a  p-th  order  Markov  chain  and  has  the  physical  and 

correlation  structure  of  a  higher  order  autoregressive  process, 

i.e.  X   depends  explicitly  on   X   ,,..., X    .   As  with  the  models 
n    v  v  j  n-1      n-p 

in  Jacobs  and  Lewis  (1978a)  the  process  is  specified  by  the 

stationary  marginal  distribution  of   X    and  several  other  chosen 

1  3  n 

parameters  which,  independently  of  the  marginal  distribution, 

determine  the  correlation  structure. 

This  DAR(p)  process  is  defined  as  follows.   Let   {Y  }  be  a 

n 

sequence  of  independent  identically  distributed  random  variables 
taking  values  in   E   a  discrete  subset  of  the  real  line  with 
distribution   tt  .   Let   (V  }   be  a  sequence  of  independent 
{0,l}-random  variables  with 

P{V   =  1}  =  1  -  P{V   =  0}  =  p 

for   0  <_  p  <  1.   Let   {A  }  be  a  sequence  of  independent  random 
variables  taking  values  in   {l,2,...,p}   with   P{A  =  i}  =  a.  ^  0, 
i  =  l,2,...,p  where  l- --\    a-    =   ■*-•   Put 

2 


(1.1)  X   =  V  X   _   +  (1  -  V  )Y 

n    n  n-A  n   n 

n 


for   n  =  1,2,...  .   Then   {X  }   is  called  the  DAR(p)  process 

(discrete  autoregressive  process  of  order  p) .   It  will  be  shown 

in  Section  3  that  it  is  possible  to  obtain  an  initial  distribution 

for   (X   ,,,  ...  ,  X,,  XA)   which  makes  the  sequence   {X  } 
— p+J-  i    u  n 

stationary  with  marginal  distribution   it. 

In  Section  2  we  will  study  the  correlational  structure 
of  the  stationary  DAR(p)  process,  obtaining  Yule-Walker  type 
equations  and  discussing  their  solution  for  the  DAR(2)  and 
DAR(3)  process. 

Section  3  discusses  the  asymptotic  properties  of  the 
DAR(p)  process;  it  is  shown  that  the  process  is  <j)-mixing  in  the 
sense  of  Billingsley  (1968)  and  consequently  that,  with  the  usual 
definitions,  sample  means,  covariances  and  quantiles  converge 
almost  surely  and  are  asymptotically  normal. 

The  process  as  defined  at  (1.1)  has  the  property  that 
all  of  the  serial  correlations   r(k)  =  corr (Xn+k ,X  ) ,  k  =  + 1 , +2  ,  .  .  . 
are  greater  than  or  equal  to  zero.   This  problem  can  be  overcome 
quite  simply  if   it   is  a  symmetric  distribution.   The  process  then 
becomes  a  full  analog  to  the  standard  linear  AR(p)  process  (see 
e.g.,  Box  and  Jenkins,  1970).   A  discussion  of  the  correlation 
properties  of  this  extended  DAR(p)  process  is  given.   When   tt   is 
not  symmetric  it  is  necessary  to  consider  schemes  such  as  coupling 
two  DAP(p)  processes  over  a  bivariate  sequence   {Yn,  Y^} ,  where 


the   Y  's  and   Y''s  are  negatively  correlated.   Since  this  leads 
n  n  n  jl 

into  the  question  of  multivariate  DAR(p)  processes,  it  will  be 
explicated  elsewhere. 


2.   CORRELATIONAL  PROPERTIES 


Let   (X  }   be  a  stationary  DAR(p)  process  having  marginal 


n 


distribution   tt   and  parameters   p,  a-f    i  =  l,...,p,  and  let 

m=E(Y)  =E(X).   From  (1.1)  it  follows  that  the  dependence  in 

the  DAR(p)  process  is  due  to  the  random  selection  procedure  by 

which   X   chooses  one  of  the   Y,  ;  k  <  n.   In  other  words  it  is 

n  Im- 

possible to  write   X   as  a  random  index  model  over   Y,  ,  k<_  n, 

although  the  indexing  variables  for  each   n   are  correlated.   To 

derive  the  correlation  structure  it  is  easier  to  work  with  the 

autoregressive  formulation  (1.1).   Note  that  with  probability 

pa. ,  X  ,,  =  X  .,   .   for   j  =  l,...,p.   This  observation  allows  one 
j        n+k    n+k-j        J     '    '* 

to  derive  Yule-Walker  equations  for  the  serial  correlations 

r(k)  =  Corr(X  ,  ,X  )   quite  simply.   For  example,  if  we  center 

the   X  ' s  to  give   X1  =  X   -  m,  then  multiplying  (1.1)  by   X'  , 
n       ^      nn  r  ■*   ^        J         n-1 

and  taking  expectations 


r(l)  =  E(X^X^_X) 

=  E(X'   V  X'    )  +  E(X'   (l-Vn)YM-m) 
n-1  n  n-A        n-1    n  n 
n 

=  pa1r(0)  +  pa2r(l)  +  •••  +  pa  r(p-l). 

Continuing  this  way,  the  complete  set  of  Yule-Walker 
equations  is  obtained: 


(2.1)  r(l)  =  pa1r(0)      +  pa2r(l)      +  •••  +  pa  r(p-l)  , 

(2.2)  r(2)  =  pa1r(l)      +  pa2r(0)      +  •••  +  pa  r(p-2)  , 


(2.3)  r(p)  =  pa  r(p-l)    +  pa2r(p-2)    +  •••  +  pa  r(0)   , 

and,  for  k  >_  1 , 

(2.4)  r(p+k)  =  pa,r(p+k-l)  +  pa2r(p+k-2)  +  •••  +  pa  r(k) 

Clearly,  when    p  =  0   all   r(k)*s/  k  =  1,2,...   are  zero,  so 

that  one  is  interested  in  solving  the  equations  for   0  <  p  <  1. 

To  see  that  the  serial  correlations  are  all  positive  or 

zero,  let   q(i)   be  the  probability  that   X   and   X   .   choose 

'  n        n+i 

the  same  random  variable   Y,  ,  where,  because  of  the  backward 

k 

definition  of  the  autoregression,  k  <_  n.   Then  q(i)   also  satisfies 

equations  (2.1)-(2.4).   Hence   r(i)  =  q(i),  i  =  1,2,...   and  the 
correlations  for  the  DAR(p)  process  are  always  nonnegative. 
If  we  define  a  generating  function 


3  (z)  =  I       7?    r(j)    for   0  <_  z  , 
j=l 


then  from  (2.1)  -  (2.4)  we  have  that 


(2.5)    B(z) 


2 

=  (z[pa,r(0)  + +  pa  r(p-l)]  +  z  [pa9r(0)  +•••+  pa  r(p-2)] 

l  p  z  p 

+  zP[pa  r(0)]}  {1  -  zpa,  -  z  pa9~  '••  ~zPpa  } 
P  i        z  p 


The  roots  of  the  characteristic  equation 


(2.6)  1  -  zpa,  -  z  pa2  -•  •  •  -  z^pa   =  0 


can  be  used  to  expand   3 (z)   by  partial  fractions  to  obtain  expres- 
sions for   r(k).   The  roots  can  also  be  used  to  characterize 
the  behavior  of  the  autoregressive  function   (r(k)}.   This  will  be 
illustrated  for  the  DAR(2)  and  DAR(3)  processes. 

First  we  note  that  since   p   and  the   a. 's  are  greater 
than  or  equal  to  zero,  the  left-hand  side  of  (2.6)  equals  1  at 
z  =  0,  (1-p)  >  0   at   z  =  I,  and  since  it  is  monotone  in   z 
for   z  >_  0 ,  there  is  one  and  only  one  positive  root  for  a  value 
of   z  >  1.   All  other  roots  are  negative  or  imaginary.   (For 
p  =  1,  the  root  is   z,  =  1/p.) 

Another  consequence  of  (2.6)  concerns  the  quantity   c 
defined  in  Jacobs  and  Lewis  (1978b).   We  have 

(X,  +•  •  •+  X  \ 
)  =  var(X,  )c, 
m      /         1 


where 


c  =  1  +  2  I      r(j) 


From  (2.5) 

(2.7)     c  =  1  +  23(1) 

=  l  +  2p{r(0)  +  (l-o^)  r(l)  +  (i-a-a  )  r(2) 
+•••+  (1-c^  a    )  r(p-l) }/(l-p) 

1  +  P-   • 
-     1-p 

The  inequality  follows  because  all  of  the  serial  correlations 
r(k)   are  nonnegative. 

Example  2.1.   The  DAR(2)  process. 

In  the  definition  (1.1)  let   p(a  =1}  =  1  -  P(A  =2}  =  a 

n  n 

for   0  <_  a  <_  1 .   If   a  =  l   the  DAR(2)  process  reduces  to  the  DAR(l) 
process  which  was  studied  in  Jacobs  and  Lewis  (1978  a,b) .   The 
Yule-Walker  equations  are  easily  solved  to  give 


(2.7)  r(l)  =  pa[l  -  p(l-a)  ]_1 


and 


(2.8)      r(2)  =  (p(l-p)  +  2pa[p-l]Hl  -  p  (1-a)  }  Y 


=    r(l)  [  (l-p)a  1    +   2p  -  1]  . 


2 
If   a  =  1,  r(l)  =  p,  r(2)  =  p  ,  ..    as  it  should  for  a 

first  order  Markov  chain;  if  a   =    0,  then   r(l)  =  0,  r(2)  =  p, 

2 
r(3)  =  0,  r(4)  =  p  ,  ...  .   To  obtain  possible  values  of   r(l) 

and   r(2)   note  that  from  (2.7) 


r(l)  (l-r(l))  1  =  ap(l-p)  1    . 


Thus  for  fixed   r(l)  ,   p   must  be  greater  than  or  equal  to   r(l) 
since   0  <  a  <  1.   Further,  for  fixed   r(l)   and   p  >  r(l) 


(2.9)  a  =  r(l)  [1  -  r(l)  ]  1    (1-p)  p  X 


Suppose   r(l)   is  fixed  and   p  >^  r(l);  then  substituting  (2.9) 
into  (2.8)  yields 

r(2)  =  -r(l)  +  p[l  +  r(l)]  >  r(l)2  . 

It  now  follows  that  the  admissible  region  for   r(l)  and   r(2)   is 
1  >  r(2)  >_  r(l)2. 

Note  that 

to    in\                      r(l)  +  r(2) 
(2.10)  1  +  r(l)  =p 

and 

r(l)  [1  +  r(2)] 


pa  =  T 

1  -  r(l)Z 

Hence,  it  is  possible  to  obtain  estimates  for   p   and   a   from 

estimates  of  the  first  two  serial  correlations. 


To  study  the  autoregressive  function  note  that  the 

characteristic  equation  (2.6)  becomes 

2 

1  -  zpa  -  z  p (1-a)  =  0 


which  has  two  real  roots,  for   0  <  a  <  1, 


Zl  2  =  ^pa  -  aH-2p(l-a)} 


where 


a  =  [(pa)2  +  4p(l-a)]1/2 


2  1/2 
Since   a  >_  [(pa)  ]     =  pa,  one  root  is  positive,  and  the  other 

is  negative.   Also  the  negative  root  is  dominant  in  absolute  value 
Thus,  the  autocorrelation  function  for  the  DAR(2)  process  is  non- 
negative  and  is  a  mixture  of  the  larger  inverse  of  the  positive 
root  decaying  geometrically  and  the  smaller  (in  absolute  value) 
inverse  of  the  negative  root  decaying  in  absolute  value 
geometrically  but  oscillating  in  sign  .   Specifically, 


(2.12)  r(k)  =  b1z1k  +  b2z2k 


for   b,   and  b~   given  as 

b1   =  z~  [1  -  z~  ]  { (z~   -  z~  )  (1  +  z~  z2  )  } 


b2  =  z2  (1  -  z1    )     {{z1      -  z2  ) (1  +  z±    z2  )} 


Eventually  the  larger  inverse  root   z»    dominates  in   r(k)   and 
the  decay  is  geometric. 

10 


Example  2.2.    The  DAR(3)  process. 
In  definition  (1.1)  let 


P{A  =1}  =  a,,     P{A  =2}  =  a0,     P{A  =3}  =  1  -  a,  -a, 
n        1         Ti  z  n  i  z 


for   0  <_  a,  ,  0  <_  a? ,    0  <_  a,  +  a~  <_  1 .   If   a,  +  a~  =  1   then  the 
process  reduces  to  the  DAR(2)  process.   The  Yule-Walker  equations 
become 


(2.13)   r(l)  =  pc^        +  pa2r(l)    +  p(l  -  a   -  a  )r(2); 


(2.14)   r(2)  =  paxr(l)    +  pa2       +  p(l  -  c^  -  a2)r(l); 


(2.15)   r(3)  =  paxr(2)    +  pa2r(l)    +  p(l  -    a±    -    a2) ; 


(2.16)   r(k)  =  pa1r(k-2)  +  pa2r(k-2)  +  p(l  -    a±    -   a2)r(k-3) 


for      k    >    3.      Thus    from    (2.13)    and    (2.14) 


2  2  -1 

r(l)    =    [pa,  +  p     (1-     a     -     a2)a    ][1    -    pa2    -    p    (1  -     a ..  -  a2)  (1  -  a2)  1 


r(2)    =    pa2    +    p(l    -a2)r(l); 


and 


2  2 

r(3)    =    p    ct,a2    +    p  (1    -    a,    -    a2)    +    {p    a,  (1    -    a»)    +    pa2>r(l)     . 


11 


u  ~  =  p  a  ~  , 


Since  equations  (2  .  13)  -  (2 .  16 )  are  linear  in   a,1  =  pa,, 

and   p   with  coefficients  involving   r(l),  r(2)  and   r(3),  they 


can  be  solved  for   a,'  , 


al      and   p,  and  thus  for   a,  , 


a  ~   an  d   p 


To  facilitate  this  note  that  (2 . 13) - (2 . 15)  can  be  rewritten  as 


(2.13')     r(l)  =  a|{l-  r(2)}  +  a£[r(l)  -  r(2)]  +  pr(2); 


(2. 14')     r(2)  = 


a£[l  -  r(l)  ]     +  pr(l)  ; 


(2.15')     r(3)  =  aj_[r(2)  -  1]  +  a£[r(l)  -  1]    +  p  . 

Hence  estimates  for   p,  a, ,  and   a~   can  be  obtained  from  the  usual 
estimates  of  the  serial  correlations   r(l) ,  r(2)   and   r(3) . 
Problems  will  arise  in  solving  the  equations  (2 . 13 ' ) - (2 . 15 ' )  if   p 
is  close  to  zero,  since  when    p  =  0   all  the  quantities  in  the 
equation  are  identically  zero. 

The  characteristic  equation  (2.6)  becomes 


2  3 

(2.17)  0=1-  pa, z  -  pa2z   -  p(l  -  a,  -  a2) z 


Again  we  require  that   a,  +   a?  <  1 ,  so  that  the  process  does  not 
reduce  to  the  DAR(2)  case.   To  find  the  roots  of  (2.17),  let 


1 

a  =  3 


a- 


a. 


1  -  a- 


1  -  a,  -  a? 


1  K(   ^2. 

H      2\I~=~aT~- 


-  9 


ala2 


(1  -  a,  -  a2) 


-  27 


p  (1  -  a,  -  a0 ) 


12 


and 


^i  -  b   4.  a 

d  "  X  +  27  • 


The  value  of   d   can  be  either  positive  or  negative.   If   d  >  0, 
equation  (2.17)  has  one  real  positive  root  and  two  imaginary 
roots  and  the  autocorrelation  function  will  have  a  "damped  sine 
wave"  appearance,  but  the  function  will  eventually  decrease 
geometrically.   If   d  <  0,  then  all  the  roots  of  the  cubic  are  real 
and  unequal,  one  being  positive  and  greater  than  one,  the  other 
two  being  negative.   Hence  the  autocorrelation  function  is  a 
mixture  of  a  geometrically  decreasing  term,  and  two  terms  which 
decrease  geometrically  in  absolute  value  but  oscillate  in  sign. 
Table  1  gives  values  for  the  first  nine  correlations  for  a 
DAR(3)  process  with   p  =  0.9   for  various  values  of   a,   and   a-. 

Example  2.3.   The  DAR(p)  process. 

The  Yule-Walker  equations  (2.1) -(2.4)  can  be  reparametrized 
with   (})1=pa1,  <J) 2  =  pa2,...,  cj>   =  pa  ,  the  parameters  all  lying 
between  0  and  1,  with  the  constraint  that 

(2.18)  0  <_   <J).  +  cj>2  +  •••  +  <f>   <  1  . 

Then  it  can  be  seen  that  the  Yule-Walker  equations  are  in  exactly 
the  same  form  as  those  given  for  the  usual  AR(p)  process  in  Box 
and  Jenkins  (1970,  p.  55).   The  parameter  space  for  the  DAR(p) 
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process  is  more  restricted  than  that  for   <J>.  ,  .  .  .  ,  (J>    in  the 

1  p 

AR(p)  case,  but  the  solutions  are  the  same.   The  parameter 
does  not  appear  explicitly  in  this  parametrization;  however 
the  parametrization  is  useful  in  the  next  section  and  when 
negative  correlation  is  considered. 
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3.   ASYMPTOTIC  PROPERTIES 

3.1.   Preliminaries:   Aperiodicity  and  the  Doeblin  Condition 

Let   {X  }   be  a  real-valued  DAR(p)  process  with  discrete  state 
n 

space   E   having  parameters  p  and  {a.;  i  =  l,2,...,p}   and  with 

distribution   tt   for  the  error  sequence   Y  .   From  the  definition 

(1.1)  it  follows  that   Zn  =  (Xr,  Xr+1,  ...  ,  X   _]_)  is  a 

Markov  chain  having  state  space   F  which  is  the  pth-order  product 

space  of   E   with  itself.   Let   P   denote  the  transition  matrix 

of   {Z  }.    Then  for   i,  k  6  F,  k  =  (kw...,k  ),  i  =  (i,,...,  i  ) 
n  ~  ~      ~      _l      p   ~      ±       p 

(3.1)  P(i,k)  >  (l-p)p  7T(k,)-..  7r(k  )  . 

~  ~  —  x        p 


For   C   a  subset  of   F   let 


y(C)  =  I      ir(k  )  .-•  7r(k  )  =  I      y(k)  . 
k^C     x         p    k6C 


The  asymptotic  theory  for  DAR(p)  processes  is  based  on  the  follow- 
ing result. 


PROPOSITION  (3.1).   The  Markov  chain   {Z  }   is  aperiodic  and 

n       r 

satisfies  the  Doeblin  hypothesis   D   as  generalized  by  Doob 
[p.  192,  [1953]]. 
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PROOF.   Fix  a  subset   C   of   F   such  that 


6  =  inf  y(k)  >  0  . 
k€C 


Let      A     be    a   subset   of      F      with      y  (A)    <    y(C)/2.      Then 


P(i,    AC   n    C) 


P(i,k) 

k6ACflC     v(U 


>        Ic        d-P)F   H<k> 
k€A  nc 


=    (l-p)P    y(AC    fl    C) 

>  (l-p)P    y(C)/2 

>  (l-p)P6/2    , 

since      y(A)     <    y(C)/2.      Hence, 


(3.2)  P(i,A)    =    1    -   P(i,    A°) 


<    1    -   P(i,    AC   n    C) 


<_  1    -   \    (1    -    p)2    6 


The  result  now  follows. 

We  will  use  proposition  (3.1)  to  show  existence  of  a 
unique  limiting  distribution  for  {Z  },  and  to  show  that  the 
stationary  DAR(p)  process  is  (J)--mixing. 
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3.2.   Stationarity  Conditions 

It  follows  from  Proposition  (3.1)  that  there  exists  a 
unique  limiting  distribution  v  for   {Z  }   and  constants 
Y  >  0   and    3,  0  <  3  <  1   such  that 


(3.3)  |Pn(v,A)  -  v(A)  |  <_   Y3n  ,       n  =  1,2,... 

for  all  subsets   A   of   F  (cf.  Doob  [1953,  p.  221]).   a 
stationary  DAR(p)  process  is  obtained  by  using   v   as  the  initial 
distribution  for   (X_    ,  X_   2 ,  ...  ,  XQ).   To  see  that   v 
has  marginal  distribution   it,  let  {y  (j),  j  £  E}   denote  the 
marginal  distribution  of  the  first  component  of   Z    and  put 

y(j)  =lim     y    (j).   Note  that 

'  J        n  ->-  °°   n  J 

lim   Yn(j)  =   I   Pai   lim   Yn_i  ( j )  +  (1-p)  ir(j)  . 

n  ■>  °°         i=l     n  •>  °° 


Hence,   y(J)  =  PY(J)  +  d-P)  tt  ( j )   and  the  marginal  distribution 
of  the  stationary  DAR(p)  process  is   tt   as  asserted  in  Section  1 


Example  3.1.   The  DAR(2)  process. 

Let   {X  }   be  a  DAR(2)  process  with  the  same  oarametrization 
n 

as  in  Example  2.1.   If 


(3-4)  y(s1,s2)    =     lim    E[exp{-SlXn+1    -    s2Xn+2}] 

and 


n  +  °o 


3(s1)    =     lim    Efexpi-Sj^Xj^}  ] 


n  ■*■  °° 
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then      Y(s,,s2)       satisfies    the   equation 


(3.5)         y(s1,s2)    =    (1-p)     fris^     3(s2)    +    pag  (s1  +  s2)  +  p(l-a)     y(s;L,s2) 


Solving  (3.5)  we  get 


Y(s1,s2)  =  y(s2,s1)  =  (1  -  r(l))  3(s1)  B(s2)  +r(l)  B^  +  s^ 


and  inverting  the  transform  results  in 


(l-r(l))   TT(i)   TT(j),  i^j. 


(3.6)    v(i,j)  =  lim  P{Xn+1=i ,XR+2=j )=< 


n  -*-  oo 

r(l)  tt(j)  +  (l-r(l))  tt  (  j  )  ^  ,  i=j 


where   r(l)  =  ap{l  -  p(l-a)}    =  corr(X  ,X   ,  )  in  the  stationary 
process.   Note  that   )^.    v(i,j)  =  7  .  ,   v(j,i)  =  tt  ( i )  ,  as  expected, 

1  til)  1  tlj 

In  simulating  a  random  variable  with  distribution  v  it  is 
simpler  to  use  the  conditional  distribution 


/  {1  -  r(l)  }  tt(j)  ,       j  ±   i, 
(j|i)  =  lim   P{Xn+2=j|Xn+1=i}  = 

n">°°  [     r(l)  +  (l-r(l))  tt(I)  ,   j  =  i. 


Note  that  we  can  now  generate  a  stationary  sequence  of  random 
variables   X  ,  n  =  1,2,...   with  any  marginal  distribution  tt  and 
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second-order  autoregressive  (AR(2))  correlation  structure.   From 

(3.6)  or  the  conditional  distribution  it  is  not  hard  to  show 

that,  with   m  =  E(Y  )  =  E(X  ) , 
'  n       n 


(3.7)  E[Xn|Xn_1    =    xn_1]     =    {1    -    r(l)}m    +    r(l)xn_1, 

(3.8)  E[Xn|Xn+l   =   xn+l]    =    {1    "    r(1)}m   +   r(1)xn+l' 

(3.9)  Var[X    lx„    .    =  x„    ,]    =    {1   -    r(l)}    E(X2)    +    r (1) {1  -  r (1) }x2    . 

n      n— 1  n-1  n  n-1 

-  U-r(l)}2m2  -  2r(l)  (1-  r(l))mxn_1. 
Directly  from  (1.1)  we  have,  for  example,  that 


(3.10)    Efx|x   ,  =  x   ,  ;  X   0=x   „] 

n'  n-1    n-1'   n-2     n-2 

=  paxn  n  +  p(l-a)x^  „  +  (l-p)m; 
n-1  n— 2. 


(3.11)  Var [X  |X   .  =  x   ,  ;  X  n    =    x      „] 
n   n-1    n-1   n-2     n-2 

=  (1-p)  Var(Y J  +  pad-palx^+pfl-a)  [l-p(l-a)  ]  x2 


2       2 

]  +  2p  a (1-a) x   ,  x 
n      K         n-1  n 


+  (1-p)  pE[Y2]  +  2p2a(l-a)x   ,x 


+  2p(l-p)m[ax   ,  +  (l-a)x  ]  . 
K      n-1  n 


The  regressions  (3.7)  and  (3.8)  might  suggest  that  the 
DAR(2)  process  is  time-reversible,  but  this  is  not  so.  To  see 
this,  note  that 
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Y(Sl,s2,s3)    =     lim    E[exp{-SlXn+1   -    s2Xn+2    -    83X^3}] 


n  -> 


pay  (s, ,s2+sj  +  p(l-a)  y(s,+s3,s2) 


+  (1-p)  6(s3)  y(s1#s2) 


Since   y  (s-,  ,  s- ,  s-J  ^  y(s-.,s2/s,)  it  follows  that  the  stationary 
DAR(2)  process  is  not  time  reversible. 


Example  3.2.  The  DAR(3)  process. 

Let   (X  }   be  a  DAR(3)  process  with  the  same  parametrization 
n 

asinExample  (2.2).   As  before  let 


Y(s  ,s2)  =  lim  E[exp{-s  X     -  s2X    }]  , 
n  ->  00 


and  let 

3^)    =     lim      E[exp{-S;LXn}]  ; 


n 


6(s1,s2)    =     lim    E[exp{-s1Xn_1   -    s2X         }  ]  ; 

n  ■*•  °° 
and 

T(s    ,s      s    )    =     lim    E[exp{-{SlXn+1    -    s   X  -    S3X        }] 

n  -*■  °° 


Put   <J>,  =  pa,,  (J) 2  =  Pa2'  an<^   <K  =  p(1  ~  a-i  ~  a2^  •   Then,  from 
the  definition  (1.1),  and  accounting  for  the  possibility  of 

X  ,  0   taking  values   X  ,~,  X  ,,,  X    or   Y  , -. ,  we  have 
n+3        ^  n+2    n+1    n       n+3 
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(3.12)  r(s1,s2,s3) 

(1-p)    B(s,)    y(St»s9)    +    01y(s1,s2+s3) 


Also 


+    02y  (s1+s3,s2)    +    (f>3    r(s3,s1,s2) 


(3.13)  y(S;l/s2) 


=    (1-p)    3(s1)    8(s2)    +    <j)    3(s1  +  s2)    +    4>2y(s2,s1)  +  <()35(s1/s2) 


and 


(3.14)  5(sirs2) 


=    (1-p)    fris^     3(s2)  +  <J>1y(s1,s2)    +    (j)2y(s1  +  s2)  +  <J>3y(s2,s1) 


Solving    (3.13)    and    (3.14)    simultaneously    for      y(si/s2^       an< 
6(3-1,32),    it    follows   that 


Y(s1,s2)    =    (1   -   r(l))    3(s1)    3(s2)    +   r(l)    3(s1   +   s2)    , 


where   r(l)  =  corr(X  ,X    )   is  given  in  Example  (2.2).   Hence, 

n  n+1  c 

inverting  the  transform, 
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(l-r(l))   TT(i)   7T(j)   ,  if   i/j, 

r(l)  tt(j)  ,  if   i  =  j. 


The  expression  (3.12)  can  now  be  used  to  obtain  an  expression 
for    r  (s.,,  s-j/S-)  /  and  substituting  back  into  (3.12)  we  get 


(3.16)       r(s1/s2,s3) 

=    {l-^}"1    {  (l-p)  [3(s3)y(s1,s2)    +   (j)36(s2)y(s3,s1) 


2 

+    <j>33  (s1)  y  (s2,s3)  ]    +    (J^  [y  (s1/s2  +  s3^    +    ^3Y  ^s3 'si  +  s2 


2 

+    (})3Y  (s2,s3  +  sx)  ]    +    <\>2  [y  (s1  +  s3,s2)    +    (j>3y  (s3  +  s2,s1) 


2 
+    <f>3Y  (s2  +  s1,s3)  ]  } 


Note  that   Ms^s^s^  f   r(s  ,s2,s.,)   in  general  and  hence  the 
stationary  DAR(3)  process  is  not  time  reversible. 

3.3.   Mixing  Results 

The  following  result  follows  directly  from  (3.3). 

PROPOSITION  3.2.   The  stationary  DAR(p)  process  is  4>-mixing  in 
the  sense  of  Billingsley  [1968]  with   <)>  (k)  =  y3  . 
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This  last  result  implies  that  all  of  the  asymptotic 
results  in  Jacobs  and  Lewis  [19  78b]  are  true  for  the  DAR(p) 
process  modulo  constants.   In  particular  the  usual  estimates 
of  covariances,  percentiles  and  quantiles  converge  almost  surely 
to  the  true  covariances,  percentiles  and  quantiles  and  the  esti- 
mates are  asymptotically  normal.   Further,  a  chi-square  goodness 
of  fit  test  can  be  obtained  for  the  marginal  distribution  of 
the  stationary  DAR(p)  process.   For  further  details  see  Jacobs 
and  Lewis  [19  78b] . 
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4.   EXTENSIONS 

One  possible  drawback  of  the  DAR(p)  process  as  defined 
in  (1.1)  is  that  all  of  the  correlations  are  positive.   We  will 
now  discuss  a  variation  of  the  DAR(p)  process  which  allows 
negative  correlation  for  the  case  in  which  the  marginal  distri- 
bution  tt   is  symmetric  about  zero.   The  case  of  nonsymmetric   tt 
is  more  difficult  and  will  be  considered  elsewhere. 

Let   {V  } ,  {A  } ,  and   {Y  }   be  as  in  Section  1  and 
n     n  n 

suppose  that   tt   is  symmetric  about  zero.   Let 


(4.1)         X   =  V  aA  X   _   +  (1  -  V  )Y   , 
n    n  A  n-A  n   n 

n    n 

where   ^v?  k=l,...,p}   is  a  fixed  sequence  of  numbers  that 

are  either  -1  or  1. 

Put   Z   =  (X  ,  X  ,- ,  . . .  ,  X  .   ,).   Then   {Z  }   is  a 
n     n'   n+1         n+p-1  n 

Markov  chain  with  a  transition  matrix   P   which  satisfies  (3.1) 
Thus  Propositions  (3.1)  and  (3.2)  still  hold,  as  well  as  (3.3). 

Let   {X  }   be  a  stationary  process  defined  by  the 
modified  DAR(p)  scheme  of  (4.1)  and  put   r(k)  =  corr (X  ,  X   ,  ) . 
The  Yule-Walker  equations  for  this  process  are 
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(4.2)  r(l)    =   <$>1  +    <J>2r(l)  +•••+    <j>   r(p-l); 

(4.3)  r(2)    =   cf^rU)  +   <$>2  +  •••  +    <J>   r(p-2); 


(4.4)  r(p)    =   (j)1r(p-l)         +   <})2r(p-2)         +•••+   cf>    r(0); 

(4.5)  r(p+k)    =   cj)    r(p+k-l)    +    4>2r(p+k-2)    +  •••  +    cj>    r(k)     ; 


for      k    >_  1      where      cf>,     =   a,  pa,  ,    k   =    l,...,p. 

Equations  (4.2)- (4.4)  have  the  same  form  as  the  Yule- 
Walker  equation  on  page  55  of  Box  and  Jenkins  (1970)  but  the 
set  of  possible  values  of  § . ,  i  =  l,...,p   is  more  restrictive 
In  particular  we  always  have  that 


(4.6)  U1l    +    |<J>2I    +•••+    |<j>    |    <    1 


Example  4.1.    The  modified  DAR(l)  process. 

Let  A   =  1  and   a,  =  -1.   Then 
n  1 


X  _,,  =  -V  ±1X   +  (1  -  V  _,_.  )Y  J_. 

n+1     n+1  n         n+1   n+1 


k  k 
and   r(k)  =  (-1)  p  .   If   a,  =  1   the  process  is  the  same  as  that 

in  Section  1.   When   a.  =  -1   the  modified  DAR(l)  process  is 

Markovian  with  transition  matrix 
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(  (1-P)   TT(j)  ,         "i^j   , 

p(i,j)  =  P(xn+1  =  Jlxn=i}   ) 

P  +   (1-p)  TT(j  )  ,    -i=j   . 


Example  4.2.   The  modified  DAR(2)  process. 

Let   P{A   =  1}  =  1  "  P(An  =2}=a,0<a<l.   Solving 
equations  (4.2)-(4.4)  we  obtain 


r(l)  =  a1pa[l  -  a^d-a)]""1, 

r(2)  =  p[a2(l-a)  -  p(l  -  2a)]  [1  -  a^U-a)]"1 


and 


2  -1 

r(3)  =  p  a[2a,a2(l-ot)  -  a,p(l  -  2a)]  [1  -  a„p(l-a)] 


Note  that  if   a,  =  1   (respectively   a,  =  -1),  the   r(l)  >  0, 
(respectively   r(l)  <  0)   and   r(2)   can  be  either  positive  or 
negative.   The  analogue  of  the  characteristic  equation  (2.6)  is 


2 
(4.6)  1  -  a, paz  -  a,p(l-a)z   =  0 


If  b.j   =  1,  then  (4.6)  has  two  real  roots;  if   a?  =  -1,  then 
(4.6)  may  have  imaginary  roots.   Thus  the  autocorrelation 
function  can  either  have  a  "damped  sine  wave"  appearance  or  be 
a  mixture  of  terms  which  decrease  geometrically  in  absolute 
value  but  may  oscillate  in  sign. 
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For  the  modified  DAR(2)  process  the  range  of  values 
which  the  parameters   <J>,   and   02   can  take  is  not  as  broad 
as  for  the  AR2  case,  for  which  we  have  (Box  and  Jenkins 
[1970,  p.  58]) 

cj>2  +  $1    <    1, 

-1  <  <\>2    <1, 

which  allows   tj>,   to  range  between  -2  and  +2.   By  contrast  for 
the  modified  DAR(2)  (for  symmetric  distributions)  we  have 


<j)2l  +  u1i   <  1, 

-1  <  (j)1  <  1, 
-1  <  (j>2  <  1. 


Thus  we  have  a  diamond  of  possible  values  in  the   (cj),,^) 
plane,  with  apexes   (0,1),  (1,0),  (0,-1),  (-1,0).   With 
this  restriction  the  behavior  of  the  serial  correlations  for 
the  modified  DAR(2)  process  is  the  same  as  the  behavior  for 
the  AR(2)  process  which  is  pictured  in  the  figure  in  Box 
and  Jenkins  (1970)  . 

A  limiting  stationary  distribution  can  be  derived  for 
this  modified  DAR(2)  process,  as  was  done  for  the  ordinary 
DAR(2)  process  in  Example  3.1. 
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